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Abstract 


Severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV) caused a severe outbreak in several regions of the world in 
2003. The virus is a novel coronavirus isolated from patients exhibiting atypical pneumonia and may have originated from wild 
animals such as civet cats in southern China. The genome of SARS-CoV is a positive-sense, single-stranded RNA whose sequence is 
distantly related to all known coronaviruses that infect humans and animals. Like other known coronaviruses, SARS-CoV is an 
enveloped virus containing three outer structural proteins, namely the membrane (M), envelope (E), and spike (S) proteins. The 
nucleocapsid (N) protein together with the viral RNA genome presumably form a helical core located within the viral envelope. The 
SARS-CoV nucleocapsid (N) protein is a 423 amino-acid, predicted phospho-protein of 46kDa that shares little homology with 
other members of the coronavirus family. A short serine-rich stretch, and a putative bipartite nuclear localization signal are unique 
to it, thus suggesting its involvement in many important functions during the viral life cycle. In this report we have cloned the N gene 
of the SARS coronavirus, and studied its property of self-association to form dimers. We expressed the N protein as a fusion protein 
in the yeast two-hybrid system to demonstrate self-association and confirmed dimerization of the N protein from mammalian cell 
lysates by coimmunoprecipitation. Furthermore, via deletion analysis, we have shown that the C-terminal 209 amino-acid region 
constitutes the interaction domain responsible for self-association of the N protein to form dimers. 
© 2004 Elsevier Inc. All rights reserved. 
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Severe acute respiratory syndrome (SARS) is a newly 
emerging infectious disease. The causative agent of 
SARS has been identified as a novel coronavirus, 
namely SARS-associated coronavirus (SARS-CoV) 
[1-3]. As of 30 June 2003, 8447 probable SARS cases 
including 811 deaths have been reported to the World 
Health Organization (WHO) from 32 countries or 
regions worldwide. 

The SARS viral genome comprises approximately 
30,000 nucleotides, which are organized into approxi- 
mately 13-15 open reading frames (ORFs), taking into 
consideration only those exceeding 50 amino acids in 
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translational capacity [4,5]. Sequence comparison with 
corresponding ORFs of other known coronaviruses re- 
veals a similar pattern of gene organization typical of 
coronaviruses [6]. 

The 1259-nucleotide N gene of the SARS virus re- 
sides at the 3’ end of the genome. This 46-kDa structural 
protein has been predicted to interact with the viral 
RNA, and both constitute the viral nucleocapsid. The N 
protein of coronaviruses is present in the icosahedral 
core as well as in the internal helical nucleocapsid [7]. 
Since no studies have hitherto been reported on the 
structure and function of the N protein of the SARS 
coronavirus, we PCR amplified and cloned the N pro- 
tein of SARS-CoV, and studied its capability to form a 
dimer by self-association using the yeast two-hybrid 
system. Here we report that the N protein of the SARS 
coronavirus is capable of self-association and requires 
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the C-terminal helix-rich 209 amino-acid region to 
facilitate this protein-protein interaction. 


Materials and methods 


Growth media, yeast strains, and plasmid constructs. All strains, 
plasmids, and plasmid constructs used in this study are described in 
Table 1. The full-length N gene of the SARS coronavirus (Singapore 
isolate) was PCR-amplified from a genomic construct of clone 
NC_004718, and cloned into the pCR-XL-TOPO vector (Invitrogen). 
The full-length N gene was subjected to DNA sequencing, and the 
inserts were verified against the corresponding region of the SARS 
coronavirus complete genome NC_004718. The full-length N gene was 
excised from the pCR-XL-TOPO-N construct using the restriction 
enzymes EcoRI and Clal, and ligated into the pGADT7 vector to 
generate an N-terminal in-frame fusion with the GAL4 activation 
domain (AD). To clone N in fusion with the DNA-binding domain of 
GAL4, pGADT7-N was digested with NcoI and PstI, and the excised 
fragment was ligated into the pAS2 vector which was linearized with 
the same pair of restriction enzymes. HA tagged N was cloned into 
pSGI by digesting pGADT7-N with Bg/II and ligating into pSGI di- 
gested with the same enzyme. Myc-tagged N was produced by ligating 
the BamHI and Apal fragment from pCR-XL-TOPO-N into pcDNA 
3.1 vector digested with the same enzymes. All DNA manipulations 
were performed as described by Sambrook et al. [8]. All deletion 
constructs were generated by subcloning the full-length N gene as 
summarized in Table 1. All constructs were verified by restriction 
digestion and sequencing. 

Yeast two-hybrid techniques. The GAL4-based two-hybrid system, 
kindly provided by Dr. Stephen Elledge [6], contain pAS2 (DNA 
binding domain vector) and pACT2 (activation domain vector), to- 
gether with the yeast reporter strain Saccharomyces cerevisiae AH109 
(Table 1) were employed. The host strain containing pAS2-SNF1 and 
pACT2-SNF4 was used as a positive control [9]. The AH109 host 
contains integrated copies of both H/S3 and lacZ reporter genes under 
the control of GAL4 binding sites. The AH109 yeast strain was 
transformed with the appropriate plasmids using the lithium acetate 
procedure, and grown on SD plates in the absence of Trp and Leu 
(SDTrp~ and SDLew_, respectively). Protein interaction was tested on 
SD plates without Leu, Trp, and His (SDLeu Trp“ His”). After in- 
cubation 30°C for 3 days, individual colonies were streaked out and 
tested for liquid and filter-lift B-galactosidase activity, by 50mM oft 
3-amino-1,2,3-triazole (3AT) assay, and by the diploid His assay. The 
filter B-galactosidase assay, a parameter directly reflecting the strength 
of protein-protein interactions, was performed by streaking doubly 


transformed yeast colonies onto filter paper and allowing them to grow 
for 2 days on selection medium. Yeast cells were permeabilized by 
freezing yeast-impregnated filters in liquid nitrogen and thawing at 
room temperature. The filter was placed over a second filter that was 
pre-soaked in 0.1 M phosphate buffer (pH 7.0) containing 300 mg/ml of 
5-bromo-4-chloro-3-indolyl-B-p-galactopyranoside (CPRG) and 0.27% 
B-mercaptoethanol. Filters were left for 48h to develop a blue color 
which indicated a positive protein-protein interaction. The liquid 
B-galactosidase activity was determined using the substrate CPRG as 
described previously [10,11]. Relative enzymatic activity was deter- 
mined for five independent transformants. Data for quantitative assays 
were collected for yeast cell number and are means + SEM of triplicate 
assays. Appropriate positive and negative controls, and buffer blanks 
were used. The AH109 host strain containing pAS2-SNF1 and 
pACT2-SNF4 was used as a positive control [9]. 

Coimmunoprecipitation assay and Western blotting. COS-1 cells were 
transfected with the pSGI-N or pCDNA3.1-N plasmid or both using 
the lipofectine reagent. At 48h post-transfection, cells were washed 
once in PBS and then lysed in lysis buffer (20 mM Tris (pH 7.5), 150 mM 
NaCl, 1 mM EDTA, |mM EGTA, 1% Triton X-100, 2.5mM sodium 
pyrophosphate, 1mM f-glycerol-PO,, |1mM Na3SO,), with protease 
inhibitor cocktail. Equal amounts of protein were incubated with re- 
spective antibodies overnight at 4°C. To the samples was added 100 pl 
of 10% suspension of protein A-Sepharose. The mixture was allowed to 
shake for 1h at 4°C following which the beads were washed four times 
in lysis buffer, and protein was eluted in 2x SDS dye by boiling the 
sample for Smin. Samples were resolved by 10% SDS-PAGE and 
transferred to a nitrocellulose membrane. The membrane was blocked 
using 0.5% non-fat dried milk in PBST for | h, and incubated overnight 
with primary antibody. The blot was then washed thrice in PBST, in- 
cubated with anti-mouse IgG HRPO for 1h and washed thrice and 
bands were detected using the ECL detection method as recommended 
by the manufacturer (Cell Signalling Technology). 


Results and discussion 
The full-length N protein can form a dimer 


The nucleocapsid (N) protein of the SARS corona- 
virus (Singapore isolate) was cloned into the yeast 
two-hybrid vectors (Table 1) resulting in an N-terminal 
in-frame fusion of the GAL4-DNA binding domain and 
the GAL4 activation domain to the N protein. In order 
to confirm the reading frame and expression of the N 


Table | 

Yeast strains, plasmids, and recombinant plasmid constructs used in this study 
Strain/plasmid/construct Genotype/description 
Strain 
AH109 


LexA-MS2 coat (TRP1). 
Constructs 
pGADT7 N 
pAS2-1 N 
1-310 N 
1-630 N 
1-830 N 
310-1260 N 
630-1260 N 
pSGI HA N 
pCDNA 3.1 myc N 


MATa, ura3-52 leu2-3,112, his3d200, trpld1, ade2, LYS2::LexA op)-HIS3, ura3::LexA-op)-LacZ, 


pCR-XL-TOPO-N cut withEcoRI and Clal and cloned into pGADT7 cut with the same. 

pGADT7 N cut with NcoI and PstI and cloned into pAS2-1 cut with the same. 

pGADT7 N cut with SacI, klenowed and vector self-ligated. 

pGADT7 N was cut with EcoRI and Nhel and fragment cloned into pGADT7 cut with EcoRI and Smal 
pGADT7 N cut with 7tHIU, fragment cloned into pGADT7 cut with EcoRI and Smal 

pGADT7 N cut with SacI and Cial, fragment cloned into pGADT7 cut with Smal and Clal. 

pGADT7 N cut with Nhel and Clal, fragment cloned into pGADT7 cut with Smal and Clal 

pGADT7 N cut with Bg/II and BamHI and cloned into pSGI cut with Bg/II 

pCR-XL-TOPO-N cut with BamHI and Apal and cloned into pcDNA3.1 myc cut with the same. 
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protein, it was expressed in vitro using the rabbit retic- 
ulocyte-based, coupled transcription—translation system 
(Promega) and immunoprecipitated using anti-HA an- 
tibody (data not shown). S. cerevisiae AH109 (MATa 
trp1-901 his3 leu2-3, 112 ura3-52 ade2 gal4 gal80UR- 
A3::GAL-lacZ LYS2::GAL-HIS3) cells were trans- 
formed with single plasmids, or cotransformed with the 
GAL4 BD- and AD-vectors containing SARS-CoV N. 
The AH109 host strain containing pAS2-SNF1 and 
pACT2-SNF4 was used as positive controls [9]. AH109 
contains integrated copies of both HIS3 and /acZ re- 
porter genes under the control of GAL4-binding sites. 
The results of the two-hybrid assay are shown in Fig. 1. 
Single transformants used in this assay were AH109 
yeast cells containing BD-N, and cells containing AD- 
N. Yeast host cells containing only BD- and only AD- 
vectors, were also used as negative controls (Fig. 1). All 
these transformants grew on the YPD non-selective 
media. The untransformed host cells were also plated as 
negative controls. Single transformants containing the 
BD-vector alone or as a fusion with N showed growth 
on the synthetic dextrose Trp” plate (SDTrp~). Corre- 
spondingly, single transformants containing the AD- 
vector alone or as a fusion with N showed growth on the 
synthetic dextrose Leu~ plate (SDLeu~). The cotrans- 
formants were similarly plated on YPD and synthetic 


AD- 


: 


BD-N 


AD-N / BD-N 
Clone # 1 


AD-N / BD-N 
Clone #2 


AD-1111 / BD-1112 


- oa 
—~ Ge 


dextrose medium lacking Trp or Leu or both, and the 
resultant colonies were subsequently plated on His~ 
medium (SDTrp- Leu“ His_ ) to test for His prototrophy. 
Growth of the cotransformants, containing the BD-N 
and AD-N constructs, in both SDTrp~ and SDLeu™ 
plates simply showed that both plasmids were present in 
the transformed cells. Growth of these clones on the 
SDTrp~- Leu” His~ media showed that the transcription 
of the HIS3 gene was switched on by the reconstitution 
of the GAL4 transactivator due to a specific N—N in- 
teraction. Colonies were transferred onto nitrocellulose 
filters and a B-galactosidase filter assay was performed 
[12-14]. The cotransformants containing both the BD-N 
and AD-N constructs together with all positive and 
negative controls were tested by this assay. Results ob- 
tained from the f-galactosidase filter assay were in 
agreement with the results obtained from the SDHis~ 
growth experiments. 

The liquid B-galactosidase assay was conducted and 
activity was determined with the substrate chlorophenol 
red-B-p-galactopyranoside (CPRG) as described previ- 
ously [15,16]. The host strain AH109 alone, together 
with single transformants containing the AD-N and 
BD-N and cotransformants containing AD-/BD- with- 
out a fusion protein, BD-/AD- and single transformants 
with either BD-N or AD-N were tested. Negative con- 


Plasmid Constructs -| SDTrp- | SDTrp ‘sotie| pc 
Leu- 


Fig. 1. Yeast two-hybrid results showing full-length homotypic interactions of the N protein of SARS-CoV. YPD yeast peptone dextrose media 
(no selection), SDTrp~, SDLeu~, SDTrp" Leu’, and SDHis™ are restrictive growth media lacking Trp, Leu, Trp Leu, and Trp Leu His, respectively. 


B-Galactosidase filter assay results are also shown. 
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trols (host untransformed cells) showed virtually no 
liquid B-galactosidase activity. BD-1111/AD-11112 was 
the positive control whereas the clones containing BD- 
N/AD-N were the test samples in this experiment 
(Fig. 2). Relative enzymatic activity was determined in 
five independent transformants from each group. Our 
results from this assay indicate a strong protein-protein 
interaction between AD-N and BD-N proteins. 

To further confirm these interactions in a mammalian 
cell environment, we performed a coimmunoprecipita- 
tion assay using N protein fused with two different tags. 
COS-1 cells were transfected with amino-terminal HA- 
tagged N or carboxy-terminal myc-tagged N or both. 
Lysates were subsequently immunoprecipitated using 
respective antibodies and immunoblotted using HA 
antibody. Lane 2 in Fig. 3 shows a cotransfected sample 
immunoprecipitated using anti-myc antibody. Lane 1 
shows the HA-tagged N-transfected sample immuno- 
precipitated using anti-HA antibody which served as a 
positive control. Lane 3 shows myc-tagged N-trans- 
fected sample immunoprecipitated with anti-HA anti- 
body which served as a negative control. Lane 4 shows 
myc-tagged N immunoprecipitated using myc antibody. 
These results clearly demonstrate that the N protein is 
capable of homodimerization in a mammalian cellular 
environment. In order to form the capsid of SARS-CoV, 
the nucleocapsid protein has to oligomerize. Hence our 


1 2 3 4 


Fig. 3. COS1 cells were transfected with HA-N,MYC-Nor both, im- 
munoprecipitated using respective antibodies, resolved by 10% SDS— 
PAGE and Western blotted using anti-HA antibody as described in 
Materials and methods. Lane | shows HA-N immunoprecipitated with 
the same antibody, lanes 2 and 3 show cotransfected cells immuno- 
precipitated using anti-myc or anti-HA antibody, respectively, lane 4 
shows myc-N immunoprecipitated using anti-myc antibody. 


data provide a functional verification to earlier predic- 
tions regarding the N protein of the SARS virus. Fur- 
ther studies on the molecular dissection of N were 
carried out to map the region of N responsible for this 
interaction. 
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Fig. 2. Liquid B-galactosidase assay results. Single and cotransformants were analyzed for a liquid B-galactosidase assay and were compared with 
others. Values are given in arbitrary units. The numbers above each bar represent the mean of five independent transformants. 
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A 210 amino-acid carboxy-terminal region is responsible 
for N dimerization 


Various deletions were designed to characterize the 
homodimerization domain of the N protein. Thus, plas- 
mids pGADT7 N 1-310, pGADT7 N 1-630, pGADT7 N 
1-830, pGADT7 N 310-1260, and pGADT7 N 630-1260 
were constructed as described in Table 1. We investi- 
gated the strength of all these interactions for the full- 
length and truncated N proteins by measuring the HIS3 
reporter gene in the presence of 0, 5, 10, 20, 25, and 
50mM of 3-aminotriazole (AT). These results indicated 
the strength of the protein-protein interaction as a 
function of His prototrophy. Cells containing both fu- 
sions of full-length N proteins showed growth on SD 
His” 50mM AT plates, thereby confirming strong ho- 
modimerization. 

Our two-hybrid experiments each consisted of co- 
transformation of one of the pGADT7 N deletions with 
the full-length fusion protein pGADT7 N. The results of 
these experiments are shown in Fig. 4. The AD 1-830, 
AD 1-630, and AD 1-310 constructs cotransformed with 
the full-length N construct (BD-N) lacked reporter gene 
activation. In contrast, the AD-N 310-1260 and AD-N 
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630-1260 constructs when tested with full-length BD-N, 
revealed reporter gene activity. The AD-N 310-1260 and 
AD-N 630-1260 when tested with BD-N (full-length) 
showed reporter gene activity. However, the AD 1-830, 
AD 1-630, and AD 1-310, when cotransformed with 
their corresponding full-length N construct (BD-N), 
showed no reporter gene activation. However, the 
strength of interaction between 630-1260 and BD-N was 
found to be lower than that of full-length N-N or AD N 
310-1260 and full length N interaction. Hence, we 
mapped the interaction domain for the N protein di- 
merization to reside in C terminal one-third amino ac- 
ids. Upon investigation of the predicted secondary 
structure model for N, we found this region to be helix- 
rich and contained the putative nuclear localization 
signal. 

Self-association of the nucleocapsid protein has been 
observed in many viruses. This process leads to the 
formation of a viral capsid which protects the genome 
from extracellular agents. In this report, we have used 
two independent assays to establish a direct interaction 
between the nucleocapsid molecules. This dimerization 
process is likely to lead to multimerization of the N 
protein and to subsequent capsid assembly. While the 
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Fig. 4. Deletion analysis for isolation of the interaction domain of the N protein of SARS-CoV. Using the yeast two-hybrid approach, each of the 
shown deletions was tested along with a full-length N protein and scored for marker gene activity. Histidine prototrophy was studied at 5, 10, 20, 25, 
and 50mM AT concentrations on SDHis~ media. Liquid B-galactosidase assay was conducted on the cotransformants and mean of five samples is 
displayed for each deletion of the N protein. The deletions divide the N protein into four regions A, B, C, and D. Numbers above each deletion bar 


depict amino-acids. 
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functional relevance of this interaction in an in vivo 
context is unknown, our report nonetheless clearly 
proves a direct physical interaction between the SARS N 
protein molecules. Moreover, the strong capability to 
self-associate was demonstrated by the N—N cotrans- 
formed yeast cells which exhibited the ability to grow at 
50mM concentration of 3AT in the yeast two-hybrid 
assay. However, the exact mechanism of interaction is 
not known. In a wide range of viruses disulfide bond 
mediated self-association of nucleocapsid has been ob- 
served [17-20], but the N protein of SARS is unable to 
do so since it does not have any cysteine as observed 
from its amino acid composition. Observations have 
been gathered from N proteins of other RNA virus ge- 
nomes (e.g., equine arthritis virus and simian hemor- 
rhagic fever virus). The disulfide bond in the 
nucleocapsid protein has been proposed to enhance the 
stability of virions: e.g., hepatitis B virus [18] and human 
papilloma virus [19]. Interestingly, N proteins of other 
corona Viruses contain cysteine residues, e.g., MHV N 
has 2 cysteine residues. Although it is not known whe- 
ther cysteine plays a role in self-association of nucleo- 
capsid in other corona viruses, the complete absence of 
cysteine residues in the N protein of SARS implies that 
it is unique among other corona viruses. It is quite 
possible that the SARS virus may have adopted an al- 
ternative conformational strategy to circumvent the 
problem of missing cysteine. In an in silico model for 
secondary predictions, both N proteins of MHV and 
SARS show a hydrophilic, helix rich C-terminal. This 
distinct secondary structure may be responsible for tight 
intramolecular self-association; however, the exact 
mechanism and forces driving the described self-associ- 
ation of the nucleocapsid proteins need further investi- 
gation. Further, since the N protein of SARS has been 
predicted to play other regulatory functions [21] besides 
capsid assembly, dimerization could be an activating 
switch for regulation of protein function. Moreover, in 
MHV, the RNA binding domain has been found to 
reside in similar locations between amino acids 163-229 
[20], which includes the C-terminal helix-rich region. We 
have identified the dimerization domain of the N protein 
of SARS to be localized to the C-terminal helix rich 
region thus suggesting a similar strategy by the SARS 
virus for nucleocapsid formation. 
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